An Overview of Discriminative Training for Speech Recognition
نویسنده
چکیده
This paper gives an overview of discriminative training as it pertains to the speech recognition problem. The basic theory of discriminative training will be discussed and an explanation of maximum mutual information (MMI) given. Common problems inherent to discriminative training will be explored as well as practicalities associated with implementing discriminative training for large vocabulary recognition. Alternatives to the MMI objective function such as minimum word error (MWE) and minimum phone error (MPE) will be discussed. The application of discriminative techniques for adaptation will be described. Finally, possible future avenues of research will be given.
منابع مشابه
An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملDiscriminative training for continuous speech recognition
Discriminative training techniques for Hidden Markov Models were recently proposed and successfully applied for automatic speech recognition In this paper a discussion of the Minimum Classi cation Error and the Maximum Mu tual Information objective is presented An extended reesti mation formula is used for the HMM parameter update for both objective functions The discriminative training me thod...
متن کاملBoosting Minimum Bayes Risk Discriminative Training
A new variant of AdaBoost is applied to a Minimum Bayes Risk discriminative training procedure that directly aims at reducing Word Error Rate for Automatic Speech Recognition. Both techniques try to improve the discriminative power of a classifier and we show that can be combined together to yield even better performance on a small vocabulary continuous speech recognition task. Our results also...
متن کاملAn RNN based speech recognition system with discriminative training
DISCRIMINATIVE TRAINING Tan Lee y, P.C. Chingy and L.W. Chanz y Department of Electronic Engineering z Department of Computer Science The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong. email : [email protected] Abstract In our previous work [1], a novel method of utilizing a set of fully connected recurrent neural networks (RNNs) for speech modeling has been proposed. Despi...
متن کاملStructured Support Vector Machines for Speech Recognition
Discriminative training criteria and discriminative models are two eective improvements for HMM-based speech recognition. is thesis proposed a structured support vector machine (SSVM) framework suitable for medium to large vocabulary continuous speech recognition. An important aspect of structured SVMs is the form of features. Several previously proposed features in the eld are summarized in ...
متن کامل